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ABSTRACT 

We examine the correlation function £ of the Sloan Digital Sky Survey (SDSS) 
Luminous Red Galaxy sample (LRG) at large scales (60 < s < 400 /i _1 Mpc) using the 
final data release (DR7; 105,831 LRGs between 0.16 < z < 0.47). Using mock catalogs, 
we demonstrate that the observed baryonic acoustic peak and larger scale signal are 
consistent with ACDM at the 1.5a level. The signal at 155 < s < 200/i~ 1 Mpc tends to 
be high relative to theoretical expectations; this slight deviation can be attributed to 
a bright subsample of the LRGs. Fitting data to a non-linear, redshift-space, template 
based-model, we constrain the peak position at s p = 103. 6I2 '4 h~ 1 Mpc when fitting the 
range 60 < s < 150 /i _1 Mpc (la uncertainties measured from the mocks). This redshift- 
space distance s p is related to the comoving sound horizon scale r s after taking into 
account matter clustering non-linearities, redshift distortions and galaxy clustering bias. 
Mock catalogs show that the probability that a DR7-sized sample would not have an 



identifiable peak is at least ~ 10%. As a consistency check of a fiducial cosmology, we 
use the observed s p to obtain the distance Dy = [(1 + z) 2 D\cz/H{z)] s relative to the 
acoustic scale. We find rjDy j z = 0. 278) = 0.1394± 0.0049. This result is in excellent 



agreement with iPercival et al.l ((2.009), who examine roughly the same data set, but 
using the power spectrum. Comparison with other determinations in the literature are 
also in very good agreement. We have tested our results against a battery of possible 
systematic effects, finding all effects are smaller than our estimated sample variance. 



Subject headings: cD: large-scale structure of universe, cosmology: observation, distance 
scale: baryonic acoustic feature, galaxies: elliptical and lenticular 



1. Introduction 



The large scale clustering of matter is a critical test of any cosmological model. ACDM models 
predict a matter auto-correlation-function £ that crosses over from positive correlations to negative 
correlations at scales of ~ 140 /i _1 Mpc. An interesting feature in £ is the baryonic acoustic peak, 
a residual from plasma sound-waves that came to a near stop at the end of the baryon drag epoch 
(zd ~ 1010). The baryonic acoustic signature is strongly imprinted in the Cosmic Microwave 
Background (CMB ) temperature fluctu ations and observations determine its physical size to high 



precision (< 1.3%; ISpergel et al.ll2003l ). Predicted to appear in galaxy clustering measurements, 



the baryonic acoustic feature can in principle provide determ ination of cosmic distances with very 
small statistical uncertainties (< l%: ISeo &: Eisensteinl 120071 ). 



The feature in the galaxy £ was first measured by lEisenstein et al.l (120051) u sing Luminous Red 
Galaxies (LRG) from the Sloan Digital Sky Survey fSDSS: hfork et alJbood ). Using ~ 47,000 
LRGs from the third data release (DR3), they determined the distance to z ~ 0.35 at 5% accuracy 
and constrained cosmological parameters. Similar measurements using later data releases detected 



a peak, but as we discuss below, a bro ader one (jCabre fc Gaztahagal |2009| ; iMartinez et al. 



2008 



Labini et al.ll2009l : ISanchez et al.ll2009l ). This apparent peculiarity may raise concerns about the 
utility of the baryonic acoustic peak in distance determination and the consistency of ACDM models 
in large-scale galaxy clustering. 

Padmanabhan et al.l (j2007i ) (DR3: lAbazajian et alJbood ) and blake et all (j2007h (DR4: lAdelman-McCarthv et 



2006) showed in their photo-z analysis of the power spectrum of the LRGs a hint of excess power at 
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large scales. iBlake et al.l (|2007l ) suggest a few possible causes for this excess power: residual system- 
atic errors, cosmic variance , large-scale galaxy biasing mechanisms, and new early Universe physics. 
Cabre Gaztahagal (|2009l ) used Data Release 6 (DR6; lAdelman-McCarthv k, Sloan Digital Sky Survey 
2008 ; ~ 75, 000 LRGs, ~ 1 /i~ 3 Gpc 3 in comoving-volume) to probe for possible systematics in data 
analysis, showing that the strong large-scale signal at s > 130 fe -1 Mpc is persis tent for various 
choices of weighting schemes and galaxy sample se lection. [M artine z et al.l (120081) confirmed th is 
stable, but wider b aryonic acoustic feature in DR7 (jAbazajian Sloan Digital Sky Survey 120091 ) . 
Labini et al.l (120091 ) reanalyzed this last sample, pointing out that there was no obvious cross-over 
to anti-correlations at the predicted scale of r c ~ 140 /i _1 Mpc or even at 250/i~ 1 Mpc. 

Other studies that investigate large-scale cluster ing and the baryo nic acou stic feature of th e 
SPSS LRGs and the 2dF Galaxy Redshift Survey Icolless et alllgOOah include ICole et al.1 (hood). 



Reid et al 



Tegmark et"al1 (hoOfil ) . Ipercival et all (|2007l ). [s^nchez et al.1 d2009l ) . IPercival et al.1 (|2009l ). and 
([20091 ). 

The purpose of this study is to measure the LRG two-point correlation function on large scales, 
compare it with the predictions of the ACDM model, and derive constraints on the baryon acoustic 
oscillation scale. We obtain precise uncertainty estimates on ^ by using a large suite of mock 
catalogs drawn from N-body simulations. In the course of our study, we examine the differences 
in £ at large scales among the different SDSS data releases; we show that the differences between 
the DR3 and DR7 (the latter contains ~ 105,000 LRGs, ~ 1.6/i~ 3 Gpc 3 ) results do not arise from 
known systematics in the data analysis. Further, we demonstrate that the stronger large-scale signal 
in the final data release is consistent with the ACDM framework. We also analyze the position of 
the baryonic acoustic feature, explaining systematics, and relate our measurement to the cosmic 
distance Dy. 

In Sj2]we discuss the SDSS data set. In \ 3] we explain our metho ds and mock catalogs. In §4. II we 
show that we can reproduce results from lEisenstein et al.l (120051 ) using their same sample (DR3). 
In §4.21 and §4.31 we apply the same technique to calculate the large-scale £ of DR7 and compare 
to LRG mock catalogs. In §4.21 we also investigate the chances of identifying a baryon acoustic 
peak in an SDSS-sized sample. In §4.41 we test the robustness of the baryonic acoustic feature 
and large-scale signal in £ to systematic errors in calibration, suggesting that SDSS photometric 
calibration errors should not affect the final results. In §4.51 we determine the location of the peak 
of the baryonic acoustic feature, and its uncertainty. In £ )4.6l we use the peak position to determine 
the ratio between the acoustic scale at the drag epoch r s to an effective distance Dy at redshift 
of z = 0.278. In the appendices we discuss technical aspects of the selection function, systematic 
uncertainties and estimators. 

In the following, unless otherwise indicated, all calculations assume a flat ACDM model with 
present day matter density &mo = 0.25 and a present Hubble expansion rate Hq = lOO/i km s -1 
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Mpc . When running mock simulations h = 0.7, but otherwise h = 1 when con verting redshifts 



tocomoving distances. When analyzing the DR3 we use the same cosmology as lEisenstein et al 

fcood ). n M0 = 0.3. 



2. Data 



The SDSS is the largest volume LRG s urvey to date, having imaged the sky at high Galactic 



latitude i n five passbands u,g,r,i and z (jFukugita et al 



(Eis enstein et al. 



2001 



1996 



Gunn et al.l I1998T) using a 2.5m 



telescope (iGunn et ali bood) The images are pro cessed dLupton et alJboOll. Istoughton et aDl2002l. 



Pier et al.lEooi . Ilvezic et al.ll2004l ) and calibrate d faogg et alJboOll.lSmith et ali bood 



2006 ), allowing selecti o n of galaxies, quas ars (jRichards et al.l [2002J) and stars for spectroscopy 



Tucker et al 



Strauss et al.l |2002| ) with twin fiber fed double spectographs. Targets 



are assigned to plu g plates according to a tiling algorithm ensuring nearly complete samples 
dBlanton et al.ll2003h . 



The SDSS LRG sample developed by lEisenstein et al.1 (|200ll ) serves as a good tracer of matter 
as they are associated with massive halos. The LRG high luminosity enables us to obtain a large 
volume, and their spectral uniformity make them relatively easy to identify. 

The SDSS LRG sample covers 19% of the sky and the sky distribution is shown in Figured! The 
sample includes a large quasi-volume-limited region to a redshift of z ~ 0.36, and is flux-limited 
thereafter extending to z ~ 0.47. The peak at z ~ 0.36 is associated with the passage of the 4000A 
break into the r-band. 

From the full sample DR7-Full we subample to answer two main purposes: Comparison with 
previous studies, focus on volume-limited regions. 

To avoid the effects of th e flux limitted region , we focus much of our analysis on subsamples of DR7 
similar to those chosen bv lzehavi etall (bood ). DR7-Dim (0.16 < z < 0.36, -23.2 < M g < -21.2) 
is produced by subsampling DR7-Full at z ^ 0.36. This quasi-volume-limited subsample is not 
a dimmer sample than DR7-Full, but rather is called "Dim" as a moniker to distinguish from 
the overlapping brighter subsample DR7-Bright (0.16 < z < 0.44, -23.2 < M g < -22.8). For 
further tests this last sample was devided into two subsamples DR7-Bright-Near (0.16 < z < 0.36, 
-23.2 < M g < -22.8) and DR7-Bright-Far (0.36 < z < 0.44, -23.2 < M g < -22.8). We also 
included results for another volume limited subsample DR7-Bright2, which is a brighter subsample 
of DR7-Dim (0.16 < z < 0.36, -22.6 < M g < -21.6). 

All subsamples are summarized in Table 1. In addition to various cuts in redshift and ab- 
solute magnitude M g , we also analyze a sample limited to the earlier release DR3 analyzed by 
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Eisenstein et al.1 (|2005l ). 



Figure [2] shows the comoving number density as a function of redshift n{z) for each sample. 
DR7-Full (black) is the full sample, from which we subsample DR3 (cyan). DR3 covers only the 
sky area of DR7-Full which w as also covered by DR3 . For most calculations in DR3 we apply the 



fiducial cosmology chosen by lEisenstein et al.1 (|2005l ). It yields a similar (but noisier) n{z) and 



comoving volume density to the DR7-Full, differing somewhat due to the different cosmologies (flat 
ACDM, with Omo = -25 in DR7 and f2ji/o = 0.3 in DR3). Our subsample agrees in detail with 



Eisenstein et al.l (|2005l ) with only < 350 mismatches out of ~ 47, 000 galaxies (0.7%). For a more 
thorough discussion of LRG selection please see Appendix IA.1I The figure also displays the radial 
selection function n{z) of subsamples DR7-Dim (green), DR7-Bright (blue) and DR7-Bright2 
(red). 

In Appendix IA.2I we show, using mock catalogs, that features in n{z) up to z < 0.36 cause a 
negligible change in the noise properties of the sample relative to a volume-limited one. 

For reasons described in §3.21 we only use the Northern Galactic Cap for subsamples DR7-Dim, 
DR7-Bright. For DR7-Full we make use of both caps, as in DR3. As explained in Appendix IB. 3\ 
we verified that the resulting £ does not change when excluding the Southern Galactic Cap region. 

A physical constraint when obtaining spectra, known as fiber-collisions, is dealt with here when 
analyzing the data. Due to the physical size of the fibers of the spectrometer, one can not obtain 
spectra simultaniously of two targeted objects that reside within 55" of each other. To reduce this 
effect, regions in the sky were overlapped by multiple observing plates. We count ~ 2% of targeted 
LRGs were missed due to fiber-collisions. To account for this we up-weight LRGs with spectra 
which are within 55" proximity of LRGs without. For more details of our method for fiber-collision 
correction, its impact on £ as well as acounting for holes and boundary effects of the survey, please 
refer to Appendix IA.1I 

All data used in this study may be obtained on the World Wide Web@ 



2 http://cosmo.nyu.edu/~eak306/SDSS-LRG.html 
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3. Method 



3.1. Clustering Estimator and Random Points 



To measure the two-point correlation function we use the lLandv &: Szalavl (|1993l ) estimator 



DD{s) - 2DR(s) + RR(s) 
RR(s) 



(1) 



which compares the normalized number of data pair counts to randomly distributed points. The 
quantity s refers to the mean redshift space separation for each bin. If we define r to be the ratio of 
the number of random points to data and Ndd(s) to be the total number of galaxy pairs separated 
by values (s — ds/2,s + ds/2] (where ds is the width of the bin), then the normalized number of 
pairs are DD = Nddt 2 , DR = Ndrt, and RR = Nrr. Here, DR and RR stand for data-random, 
random-random pairs, respectively. The random points account for the effects introduced by survey 
boundary, holes within the data set, and sector incompleteness. 



Additionally, in our counting of pairs, we apply weights to each galaxy. Appendices [A] and IB. 21 
discuss the details of distributing the random points and the pair-count weighting. In Appendix 
IB.3| we also compare thi s estimato r to ot her known methods, showing excellent agreement with 

Hamilton! (119931) on large scales, and s ubstantial differences with those 



the method proposed by _ 

proposed by Davis fc Peebles! ( 1983 ) and Peebles fc Hauser ( 1974 ). 



3.2. LRG Mock Catalogs 

We use mock galaxy catalogs produced from the LasDamas simulations (McBride et al. 2009; 
in prep) to measure the uncertainty covariance matrix, as well as to investigate systematic errors 
in the two-point-correlation estimators. These mock catalogs provide 160 light-cone redshift-space 
realizations of an SDSS-sized volume, with appropriate number densities and clustering properties 
for comparison to the observed data. 

The LasDamas simulations are designed to model the clustering of SDSS DR7 in a wide luminosity 
range. In this suite of simulations galaxies are artificially placed in dark matter halos speci fically 



using the formalism of the halo occupation distribution (HOD; iBerlind &; Weinberg] |2002| ) with 
parameters chosen to match an observational SDSS sample. For complete details see McBride et 
al. (2009, in prep.); for distribution visit the the World Wide WebU 



3 http://lss. phy.vanderbilt.edu/lasdamas/ 
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We use the "gamma" release of mock LRG catalogs produced from simulations. The Oriana 
simulation consists of 40 iV-body dark matter simulations; each realization contains 1280 3 particles 
of mass 45.73 10 10 h~ 1 M.Q in a box of length 2400/i -1 Mpc. The softening parameter is 53/i _1 kpc. 
The simulations assume a flat ACDM cosmology with total matter density FImo = 0.25, baryon 
density f^o = 0.04, Hubble expansion rate Hq = 70 km s Mpc , a% = 0.8, and spectral index 
n s = 1. The HOD parameters are adjusted to reproduce the observed number density as well as 
the projected two-point-correlation function w p (r p ) at projected separations 0.3 < r p < 30 /i Mpc, 
well below the scales we consider here. The choice of HOD affects the resulting £(s) shape on small 
scales, but on large scales HOD pr i marily biases the amplitude as expected from local galaxy bia s 



arguments (|Frv Sz Gaztanaga 



1993 



Coleslll993l ; IScherrer Weinberg 



1998 



Narayanan et alj|2000i ). 



The LasDamas team has divided the sky into two alternative footprints: one can use either the 
SDSS Northern Galactic and Southern Galactic Caps together, or only the Northern. We use the 
latter option because in that case each simulation produces four samples (as opposed to two in the 
former) resulting in twice as many realizations. In summary, we analyze 40 x 4 = 160 LasDamas 
mock catalogs for each of both luminosity subsamples DR7-Bright and DR7-Dim. We note that 
the angular distribution of galaxies in these mocks is similar to the angular distribution of the data, 
aside from the observational incompleteness. In Appendix [Aj] we explain how we account for the 
incompleteness within the observational data. 

Our only manipulation of the catalogs provided by McBride et al. (2009; in prep) is in the radial 
direction, where we randomly subsample to match the SDSS comoving volume density n(z), to 
better represent the Poisson noise. This subsampling reduces the number of LRGs in DR7-Dim by 
15% and in DR7-Bright by 7%. In Appendix lA.2l we show that this subsampling does not noticeably 
affect either £ or its uncertainties. For more details regarding the radial selection function, please 
refer to that Appendix. 



3.3. Covariance Matrix and x 2 Fitting 

In our analysis we use the standard x 2 fitting algorithm based on a covariance matrix, which is 
constructed from the LasDamas mock realizations. 

As the uncertainties in the £ values for the bins are interdependent, we build a covariance matrix 
Cjj to measure the dependence of the i th bin on the j . We construct Cj,- from the individual 
mock realizations as follows: 

-| ^mocks 

<*-iw^ Eft -*)(*,-<?). < 2 > 

k=l 
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where £ m is the correlation value for the m th bin of the mock mean and is the same for the k th 
mock realization. In all calculations here, iV moc k s = 160. 

We can then estimate \ 2 f° r our observational result £ obs relative to models £ model using: 

N s 

x\o) = E (^ obs - tr^w) Cy 1 (ef s - er dcl w) , ( 3 ) 

where 6 are parameters of the model, and N s is the number of separation bins used. 



4. Results 



Here we present our results and analysis of the redshift space angle- averaged £(s) of the SDSS 
LRGs in two releases: DR3 and DR 7. In jj4.ll we show th at we can reproduce the DR3 LRG 
selection as well as the £(s) results of Eisenstein et al.l (|2005l ). Differences between DR3 and DR 7 
are, therefore, not due to systematic differences in our analysis from that of Eisenstein et al.l ((2005). 
In §4.21 we use mock catalogs to show that the large scale clustering of DR7 LRGs is consistent 
with the concordance ACDM model at the 1.5a level. Furthermore, in §4.31 we investigate a bright 
subsample, which has a strong large-scale signal relative to our ACDM predictions, not seen in the 
other subsamples. In §4.41 we propose a possible observational bias, and explain the robustness of 
the baryonic acoustic feature and large-scale clustering to this effect. In §4.51 we measure the peak 
position of the baryonic acoustic feature. We use this last measurement in §4.61 to determine the 
ratio r s /Dy and compare results to other studies. 



4.1. Reproducing the DR3 £(s) 



To demonstrate consiste ncy with previous studi es, we first compare our analysis of an earlier data 
release, DR3, with that of Eisenstein et al.l (120051 ) . For this analysis we create a DR3 LRG sample 



with the same criteria used by 
this subsection we assume Qmo 



Eisenstein et al 



(|2005l ). and a corresponding random sample. In 



0.3 in calculations. 



In F igureEJwe reproduce the redshift-space £ of the SDSS LRGs first measured by Eisenstein et al. 
(|2005l ). There is good agreement between our results (green diamonds) and theirs (red crosses). 
To investigate the minute remaining differences, we also test using their random catalogs with our 
data set and vice-versa. We conclude that the remain ing subtle variations ar e due to Poisson noise 
in the random catalogs. As previously presented in Eisenstein et al.l (|2005l ). we obtain a narrow 
peak at an apparent separation s > 100/i -1 Mpc (see §4.51 for our analysis of peak position value), 
and a steep slope that flattens at ~ 135 /t —1 Mpc. 
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The dashed lines in Figure [3] display our result for the final data release DR7 (sample DR7- 
Full), which runs through the same galaxy selection algorithm and data analysis as DR3. The 
thick blue dashed line uses the same cosmology as in DR3; for the thin black dashed line we alter 
that cosmology t o fl^ n = 0.25, SIao = 0.75. The binning in all cases is the same chosen by 



Eisenstein et al.l (|2005l ). with the exception that for DR7 we extend the signal a bit further. In 



both DR7 cases the resulting position of the baryonic acoustic feature is in fair agreement ( §4.5p . 
However, we clearly see stronger power on large scales, yielding a wider peak. In the next section we 
examine the significance of this strong large-scale signal relative to our ACDM model predictions. 



4.2. Consistency of SDSS LRGs Clustering with ACDM 

We test the DR7 LRG clustering against a ACDM model by comparing the observed £ to those 
yielded by the LasDamas mock LRG catalogs ( §3.2|) . We run the same analysis as before on each 
subsample of DR7: DR7-Dim and DR7-Bright (see Table 1 and Figure [2]) . We also analyze DR7- 
Full, which has the disadvantage of being flux limited at z > 0.36 but can probe larger-scale modes. 
We find that DR7-Dim is in clear agreement with the model used in the simulations. DR7-Bright 
also has a strong signal at s > 150 /i _1 Mpc. 

We first investigate the quasi-volume limited subsample DR7-Dim showing the SDSS results 
compared to the 160 LasDamas mock realizations. This subsample has a similar £(s) to DR7-Full 
at most radii s < 145/i~ 1 Mpc (see Figure U] for direct comparison). Our results are presented 
in Figure The mock mean is indicated by the green solid line and the SDSS results are the 
diamonds. The light gray shaded region represents the area in which the 68.2% of the realizations 
lie closest to the mean (that is, the la uncertainties). The dark gray area is the same for 95.4% 
(2cr uncertainties). The dotted blue lines indicate the strongest and weakest signal of all mocks at 
each separation (not one realization in particular). 

The observed signal is clearly within the 1— 2a significance level at all scales. To quantify the 
significance of the strong signal at large scales (s > 130/i _1 Mpc) we reproduce this last result to 
s < 400/i~ 1 Mpc (top inset; larger separation binning). Although the expected signal at the largest 
scales is very small relative to the noise, in the top inset we include data as far out as possible 
without significantly reducing pair counts due to edge effects. For a discussion regarding edge 
effects, please refer to Appendix IB.3I 

We also show in Figure [5] a histogram of the mean correlation on large scales (£) s =[i3o 4oo]/i- 1 Mpc 
(bottom inset; red dashed line is SDSS, green dot-dashed is the mock mean), where the brackets 
denote an average over all separation bins in the indicated range. To further quantify the significance 
of the large-scale signal, we measure the \ 2 difference between the observed £ and the mock mean 
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using Equations [2] and [3j In the last equation we limit our bins to those between 130 < s < 
400/i~ 1 Mpc. Using the 10 bins in the top inset (d.o.f= 10) we measure a normalized value of 
X 2 /d.o.f = 0.721. We have tried several definitions for the significance of large-scale power and all 
agree that there is a 1.5a consistency with respect to the ACDM plus HOD model used here. 

Figure [6] shows several hand-picked mock realizations to demonstrate the effects of variance. We 
caution the reader to avoid interpreting these particular realizations as "typical" . Instead, we have 
chosen them to demonstrate the variety of signals we could be reasonably expected from a survey 
the size of SDSS DR7-Dim. Whereas some realizations (e.g., green-dotted and red-solid lines) have 
a similar clustering signal to that observed (symbols), others do not show evidence of the baryonic 
acoustic feature (e.g. the orange dashed and cyan dot-dashed lines). 

Visually examining at all 160 realizations, we find that at least 17 (~ 10%) of the mocks have no 
peak at the expected s ~ 100/i _1 Mpc. We examined by eye all 160 mock DR7-Dim £ results and 
defined a peak-less realization as one with no sign of a peak within 95 < s < 120 /i _1 Mpc. We took 
a liberal approach, so this result should be considered a lower bound; i.e, realizations with subtle 
peaks were counted as having a peak. If we ask how many of the mocks have very clear signs of 
a peak, we find about 75 out of the 160 do. This designation is subjective, of course. It is also 
difficult to compare these numbers with theoretical expectations, e.g. using random Gaussian field 
statistics is not enough as weak peaks in the Gaussian case (corresponding to the linear density 
field) are washed out by nonlinear motions. 

To verify the result of feature-less realizations, we also checked the mock realiza tions in another 



independent mock LRG catalog. In the Horizon- Run mock catalog (jKim et al.ll2009i ). LRG positions 
are determined by identifying physically self bound dark matter sub-halos that are not tidally 
disrupted by larger structures. After adjusting their mock galaxy catalogs to fit the SDSS radial 
function n(z), we ran the same analysis as for LasDamas and found that, of their 32 realizations, 5 
showed no sign of the baryonic acoustic feature, comparable to the result obtained with LasDamas 
mock catalogs. 



4.3. Bright Subsample DR7-Bright 



Figure H] also shows that the correlation function of DR7-Full (dashed line) is stronger than 
DR7-Dim (green diamonds) at scales of 150 < s < 200 h~ 1 Mpc. To help understand the difference 
between these samples we examine another subsample, DR7-Bright. 

The correlation function £ of the brighter volume limited sample, DR7-Bright (blue crosses), 
is stronger on all scales than DR7 -Full and DR7-Dim , which is expected, as bias is known to be 
a function of luminosity (see e.g. IZehavi et al.ll2005l ). For example, Figure [7] shows the relative 
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redshift-space bias ratios, defined as 



—Bright (4) 



showing that DR7-Bright is biased by a factor 1.15 on most scales relative to DR7-Dim, in agree- 
ment with the mock catalogs (by design, of course, at scales r p < 30 h Mpc). 

As discussed in §4.51 Figure H] shows a good agreement in baryonic acoustic peak position among 
the DR7-Full and DR7-Dim. However, the relative strengths of the large-scale signal is worth 
investigating. Figure [8] is similar to Figure [5] for DR7-Bright. The figure and its insets show that 
its signal is significantly stronger than the mock values on scales s > 130 h M.pc. 

The bottom inset of Figure [8]is the histogram of (Os=[i30,500]fe- 1 Mpc f° r the mocks, compared to 
the SDSS value (red vertical dashed line). We perform the same x 2 comparison as in DR7-Dim, 
but this time for scales up to 500/i _1 Mpc (see App endix IB . 3 1 for a justification of the larger scale). 
This test yields x 2 /d.o.f = 2.5 for 11 degrees of freedom (separation bins of top inset), showing an 
unlikely fit with the model used here. 

We note that this result does not rule out all possible ACDM and HOD models, because we have 
compared to only one choice of parameters designed to fit statistics on smaller scales (see §3.21 for 
details). 

We also split DR7-Bright into two subsamples: DR7-Bright-Near (0.16 < z < 0.36; ~ 16,500 
LRGs) and DR7-Bright-Far (0.36 < z < 0.44; ~ 13,800 LRGs). Each produce noisy results for 
£, so we could not draw concrete conclusions regar ding large-scale clus t ering and the baryonic 



acoustic feature. For example, we find, as shown in ICabre fc Gaztanagal (|2009l ). that the distant 
subsample DR7-Bright-Far has no clear sign of a baryonic acoustic feature. In any case, both 
subsamples showed strong clustering on large scales — the signal in the DR7-Bright subsample is 
thus not coming preferentially from high or low redshift. 



4.4. Effects of Systematic Calibration Errors 



Given the large scales and small signals we are probing here, we need to test whether our result s 
are sensitive to systematic errors in the calibrations as a function of angle. lEisenstein et al.1 (120011 ) 
cautions that the number count of LRG targets is sensitive to small variations in color cuts, espe- 
cially in the g and i bands. Since the SDSS targeting catalog is k nown to have calibration er rors at 
the 1% level, the true color cuts applied vary across the survey (jPadmanabhan et al.ll2008l ). Such 
a variation might introduce an artificial clustering signal in our analysis. 



To test the possible effect of these errors on £, we subsample the LasDamas DR7-Bright sample 
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in a spatially varying fashion, and analyze the correlation function of the resulting sample (using 
the original, un-subsampled random catalogs). In detail, the subsampling factor varies from 95- 
100% in sinusoidal waves along the declination direction with a wavelength of 10 deg. This choice 
is motivated by the fact that the targeting catalog is separately calibrated in each stripe, which 
spans 2.5 deg and are (very roughly) parallel to lines of constant declination. 

Comparing the resulting £ with the full mock catalogs, we find insignificant effects for individual 
mock realizations and no difference when averaging over all 160. We tested this both on the full 
DR7-Bright and on a subsample DR7-Bright-Far (0.36 < z < 0.44; see Table 1). Effects are 
significant only when enhancing the subsampling factor amplitude from 5% to 15%, well above 
the expected systematic uncertainties due to calibration. Even at this unrealistic uncertainty, the 
baryonic acoustic feature is noticeable, on average, though with a weaker amplitude. 

Notably, we cannot obtain the strong large-scale signal as observed in Figure [8] using any realistic 
level of calibration uncertainty. 



Here we measure the baryonic acoustic peak position s p . We first construct a model £ 
which is related to the average correlation function of the mock catalogs £ by: 



where (3 represents a bias term and a represents a change in length scale. We minimize x 2 over a 
and (3 (see Equations [2] and [3]) , using the observed £ at scales near the peak. After f3 and a are 
determined we use a spline interpolation on the (very smooth) best-fit model to pinpoint s p . This 
procedure is run on the observed £ and, as explained below, for the individual mock realizations. 

The advantage of using this approach over linear models is that clustering nonlinearities, clus- 
tering bias and redshift distortions are already taken into account in the simulations, as well as the 
angular mask and radial selection. A disadvantage is that we are not self-consistently altering the 
cosmology for each model. However, in §4.6^ when using s p to determine cosmological distances, 
we expect the uncertainties in assumptions we make to be small relative to the data uncertainties. 

£ used above is based on the average £ of 75 mocks with a clear peak (as is the case in the 
observations). Another option we test is using £ constructed from the mean of all 160 realizations. 
The 160 mock mean yields what peak position is expected given a DR7-Dim size sample, where 
the 75 mock model is the same only given a realizaation with a clear peak. As we demonstrate 
below, the choice of mock mean does not significantly vary results. 



4.5. Baryonic Acoustic Peak Position 



model 




(5) 
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In Figure [9] we show the correlation functions of £ 75 constructed from clear peaked realizations 
(dashed bright green) and £ 160 constructed from the full catalog (solid black). As expected, both 
have roughly the same clustering at all scales, except that the latter has a weaker peak. The postion 
of the baryonic acoustic feature is nearly the same (107.88 h^Mpc for the former 107.96 h^Mpc 
for the latter), and they have roughly the same width. The bottom panel shows the residual 
£ 75 — £i6o- As the observation has a clear peak we choose for our final result to use £ 75 , and also 
show that £ 160 yields a similar result. For both we use the covariance matrix constructed from all 
160 realizations. 

Figure [10] diplays our result for DR7-Dim using £ 75 . The upper left panel shows the distribution 
of the best-fit peak position for the mocks and the observations. The vertical red line is the best-fit 
for the observations, the vertical green line is for the mean mock catalog. The upper right panel 
exhibits the distribution of x 2 P er degree of freedom. The vertical red line marks the results from 
the observations and the histogram is that of 75 clear-peaked mocks. The bottom left panel is the 
normalized covariance matrix Cij/^CuCjj. The bottom right panel compares the best-fit model 
frmodei ( D i ue so lid line) to the observations (symbols), and the red arrow points to the fit peak value. 

The mean peak value for the mocks (red line on the main plot) is = 107.88l2'sl h~ 1 Mpc, 

where the la uncertainties are calculated from mock realizations only with clear signs of a peak 
(about 75 out of the 160). The individual mock s p results are shown as the histogram on the upper 
left plot and their normalized x 2 result in the upper right plot. We obtain the uncertainty on the 
observations by scaling these uncertainties for this mock catalog distribution by s p /s™ ocfc . 

Using the observed DR7-Dim sample between 60 < s < 150 /i _1 Mpc we find the peak to be 
at s p = 103.64^241 /i~ 1 Mpc (jj'jjp) wnere these la uncertainties are scaled from our mock catalog 
results. For this fit, we find x 2 /d.o./. = 1.09, where we used d.o.f.= 19 (the number of data points 
used minus two fitting parameters (3, a). Thus, the fit is excellent. By eye it appears poorer because 
it undershoots the data at all points, as one's eye does not account for the strong covariances among 
the bins (see lower left panel of Figure [T0|) . 

If we limit ourselves to the region 60 < s < 135 /i -1 Mpc we find s p = 105.96^2 46 ^ _1 Mpc. Al- 
though both results are consistent, the dependence of s p on fitting range indicates that this sample is 
still limited in its power to constrain s p . We also changed the lower limit to various values between 
[55,75] /i _1 Mpc, constraining the upper bound to 150/i _1 Mpc , and find s p does not vary more 
than 0.6 /i^Mpc. If the lower limit is increased to 80 h^Mpc we obtain s p = 102. Ol^!^ h^Mpc, 
showing a little sensitivity due to not using the full dip feature around 80 /i _1 Mpc. 

We perform the same analysis on 24 jackknife subsamples of the observed sample. They are 
obtained by dividing an RA-Dec map into 24 regions of same number LRGs and excluding one 
region each in turn. For each jackknife subsample we calculate £ and run it through the peak finder 
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algorithm. The uncertainties yielded from jackknifing yield a± = o 26 ^ X Mpc, smaller than the 
sample variance indicated above. 

When using £ 160 we obtain for the data between 60 < s < 150 /i _1 Mpc s p = 104.17j;|o5 fi -1 Mpc, 
very similar as before. The x 2 /d.o.f is 1.13. 

Besides sample variance, another source of uncertainty in s p is due to random-shot noise. In 
Appendix lB.il we show that choices of different random catalogs (ratio of r ~ 15.6 random points 
per data) yield slightly different s p . To reduce this effect we choose for our final results to use 
r ~ 50. 

We also test the effect of dil ution on s p , i.e., choosing a d ifferent cosmology when converting 



redshifts to comoving distances (jPadmanabhan fc White! 12008 ). In all above results we apply a flat 



cosmology with Qmo = 0.25. When using a different value Qmo = 0-30 (but maintaining mocks 
as before) we obtain s p = 102.70 /i _1 Mpc at x 2 /d-o.f.= 0.96. As this result is well within the 
la variance and systematics explained above, we conclude that the choice of cosmology does not 
change our results, but may need to be considered when variance is reduced in future surveys. 

We find that the £ for the DR7-Bright subsample, in the same binning used for DR7-Dim, is quite 
noisy and not useful to measure s p . Although DR7-Bright covers a larger volume than DR7-Dim, 
its density is over four times lower yielding a sample with less than half the number of galaxies, 
severely increasing noise in our measurement. 

We applied the same algorithm (same DR7-Dim covariance matrix) on DR7-Full and DR3 and 
find consistency with these results. A more in-depth analysis would involve building a new co- 
variance matrix for DR7-Full, as it has a larger volume than DR7-Dim. We do not perform this 
analysis, because the LasDamas mocks do not extend to that volume. DR3 has fewer LRGs than 
DR7-Dim resulting in a noisier signal, so we do not find reason to do further analysis. 



4.6. Relating Peak Position s p to Cosmic Distance Dy 



As lBashinskv fc Seljakl (|2004h describe, the peak position s p is closely related to r s , the physical 



comoving sound-wave radius at the end of the bar yon drag epoch. r. q is d etermined to high precision 
in physical units from CMB measurements (e.g., iKomatsu et al.ll2009l ). whereas we determine s p 
in redshift space. Therefore, we can compare the tw o to measure the relationship between redshift 
and distance, as proposed by lEisenstein et al.l (|1999i ). 



There are three important effects we must consider in such a determination. First, the relation- 
ship between the redshift space peak position s p and r s is sensitive to non-linear cl ustering, redshift 



distortions and galaxy bias, the dominant effect being a broadening of the peak (lEisenstein et al 
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2007bl : ICrocce & Scoccimarrol [2008l: ISmith et alJbood : kngulo et aDhood : ISeo et alJ[20oi : lEisenstein et~al 



2007a 



Kim et al 



20091 ) and a subdominant shift towards small scales that is below current statis- 



tical uncertainties. 

Second, the measured s p is necessarily associated with a fiducial cosmology used to interpret the 
redshifts and angular positions in terms of comoving distances. The fiducial cosmology we assume 
is the same concordance flat cosmology as the mocks used ([Tl^, ^l^, h^ id ] = [0.25,0.04,0.7]). 
Technically this means that we use cosmological assumptions in two steps of our algorithm: selection 
of LRGs by magnitude cuts, and converting observed redshifts to comoving distances. For more 
details please refer to Appendix IB.3I 

Third, we need to specify what "distance" we seek to measure, since the definition of cosmological 
distance within the context of general relativity is not unique. The relevant distance measures in 
this context are the angular diameter distance Da(z) and the Hubble constant H{z) at redshift z 
( Hoge]|l999l ). The former would be ideally constrained by measuring a p in a thin shell at radius 
z, and H(z) by measuring the line-of-sight clustering (jMatsubaral 120041 ). These measurements 
individually are hard to perform with the c urrent data set (for attempts in DR6 please refer to 
Okumura et al.ll2008l . iGaztanaga et al.ll2008l ). Instead, we will constrain a combination of the two 
following the standard approach in the literature (see Equation [7] below). 

To account for the above effects, the standard interpretation of the acoustic peak in the correlation 



funct ion is as follows (with an analogous argument made for the power spectrum; iPercival et al 
20071 ). First we assume the proportionality: 



r s = 7V 



(6) 



Mock catalogs can be used to determine 7, as long as the assumed cosmology is not far from the 
true one. We have done so here for the LasDamas mock catalogs, whose cosmological parameters 
are well-motivated from the CMB and other constraint s. We calculate r{ ld (zd) = 159.75 Mpc for 



the mocks using Equation 1 from iBlake &; Glazebrook! (120031 ) , and calc ulate the sound speed c . 9 
and the end of the drag epoch Zd = 1012.13 using Equations 4 and 5 from lEisenstein fe Hul ([1998). 
We insert 02.7 = 2.725/2.7 as the temperature ratio of the CMB in their Equation 5. Using s mock 
from §4.51 and h = 0.7 we obtain 7 = 1.037. 



Second, we construct the "distance" quantity: 



D v = [(l + z) 2 D 2 A cz/H(z)] 



(7) 



This quantity is designed in such a way that the ratio Dy((z)) /s p ((z)) is approximately indepen- 
dent of the choice of the fidu cial cosmological model (jEisenstein et al.l 120051 ; IPercival et al.l 120071 : 
Padmanabhan k White] I2OO8I ). 
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Third, we can rearrange these relationships to obtain: 

D v ((z}) _ D v ((z),M) 



(8) 



where s p ((z}) is understood to be the inferred s p from §4. 5 1 given the fiducial cosmology. Measuring 
s p thus yields the ratio of the distance at redshift (z) to the acoustic scale. Given the acoustic 
scale r s in Mpc from the CMB, we then can determine the distance Dy((z)) to that redshift. As 
Equation [7] shows, this distance measurement constrains a combination of the angular diameter 
distance Da and the Hubble constant H{{z)). 



Given the results in g3J we find r s /D v = 0.1394± 0.0049 at (z) = 0.278, with la uncertainties 
of around 3.5%. Both our values and uncertain ty bars are in excellen t agreement with the analysis 
of the power spectrum of a similar sample by iPercival et al.l (12009). Combining our re sult with 
the sound-horizon r s = 153.3Mpc obtained from WMAP5 CMB (jKomatsu et al.1 12009| ) we find 
ZV(0.278) = 1099±38 Mpc. 



Figu rellllpresents our result compared with those obtained bv lPercival et al.l (|2009i ). lEisenstein et al 
(|2005l ). as well as predictions of flat ACDM cosmologies. Our data point is the black diamond where 
we choose to use the mean s ample reds hift (z) = 0.27 8 . The red and orange crosses are the values 
published in iPercival et al.1 (j200g|) and IPercival et al.1 J2003): r s /D v (z = 0.2) = 0.1981 ± 0.0071, 
r s /D v (0.275) = 0.1390 ±0 0037, rjD v (0 . 35) = 0.1094 ±0.0040 where the small red crosses results 



are indicated in Table 3 of 



Percival et al.1 (12003 ). The cyan cross is r s /D v (0.35) = 0.1097 ± 0.0039 



obtained by Ifieid et al.1 (|2009l ) when analyzing the P(fc) of the reconstruct ed halo density field of 
DR7 LRGs. The blue square is the result obtained by lEisenstein et al.l (|2005l ) (D v (0.35) = 1370±64 



Mpc, if we use the CMB r s value as before). We also add the result obtained by ISanchez et al 



([2009J) (purple triangle) who analyze the DR7 LRG £. Using only LRG clustering they obtain 
Dy (0.35) = 1230 db 220 Mpc. They show that when combining £ with CMB measurements they 
obtain a tighter constraint Dy (0.35) = 1300 ± 31 Mpc. We plot the latter where we use r s from 
CMB for plotting purposes. 



Our result is in perfect agreement with that obtained by IPercival et al.l (|2009l ) at z = 0.275. 
Please keep in mind that both results, as well as those obtained by other above studies that analyze 
the SDSS LRGs, are not independent, as we use roughly the same data set. We are encourag ed, 
nevertheless, that our consistency check yields a result in agreement with IPercival et al.l (|2009l ). 



We comment that if we use the median redshift 2 me d = 0.287 rather than the mean (z) we obtain 
r s /D v (z = 0.287) = 0.1354 ± 0.0047 and D v (z med ) = 1132 ± 40 Mpc. 



When comparing these results to cosmological predictions, we assume a flat ACDM, and fix lom = 
tt M0 h 2 = 0.1358. This const raint is motivated by its low (2.7%) uncertainty in the WMAP5 CMB 
temperature measurements (jKomatsu et al.ll2009l ). Dy depends both on Q-mo and h independently, 
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thus its values vary. Meanwhile, r s depends on u>m (and fl^h 2 ) so it is kept constant. The la results 
of r s /Dy indicate that, constraining wm from CMB, the preferred region of the matter density lies 
in the range Qmo = [0.25,0.31], and h = [0.66,0.73] in agreement with CMB and others. This is 
consistent for both choices of redshift (mean or median). If we would plot median redshift result 
(z = 0.287) in Figure [11] it would appear along the f^MO = 0.28 line. We defer a full analysis of the 
cosmological implications. 

Our £ results for DR7 LRGs as well as the covariance matrix may be be obtained on the World 
Wide WebQ 



5. Discussion 



The nature of the baryonic acousti c peak and larger scales in have also been discussed in 
previous studies. ICabre Gaztahagal (|2009l ) used DR6 (77, 000 LRGs) and found a similar level of 
clustering to ours and examined various possible data analysis systematic effects that might cause 
the strong signal (the positive £ at scales larger than 150 /t - 1 Mpc). We followed sim ilar steps with 
the addition that we revis i t DR 3 to reproduce results from lEisenstein et al.1 (|2005l ). In agreement 
with [Cabre fc Gaztahagal (|2009l ). we show in our Appendix [B] that data analysis systematics have 
no significant effects on these results. 

The main difference between DR3 and DR7 is the sky coverage. DR7 covers over twice as much 
sky and, as opposed to the DR3, is continuous. The latter should not be an important issue due 
to the fact that the random points used in the £ estimator take into account boundary effects and 
holes within the RA-Dec plane. The mock catalog tests we conducted suggest that sample variance 
is a possible explanation for the difference between the large-scale signals of the two data sets. 



Martinez et al.l (J2008|) also present a wide baryonic acoustic feature and large-scale clustering in 



the £ for SDSS LRGs (D R7). It is worth n o ting t hat their definition of M g is slightly different than 
the one used here and in lEisenstein et al.l (|2005h . Also, as described in Appendix lA.il we correct 
for data angular incompleteness where they did not. Nevertheless, our DR7-Full £ results are in 
fair agreement with their DR7-LRG. 

We show that sample variance affects not only the shape of the signal at large scales (hence helping 
explain the broadness of the baryonic acoustic feature) , but also the probability of detecting a peak: 
we found that at least 10% of the mock realizations lack a baryonic acoustic signal. Nevertheless, 
we show, in agreement with other studies mentioned here, that the SDSS LRG sample contains 



4 http://cosmo. nyu.edu/~eak306/BAF. html 
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a baryonic acoustic feature which is stable within most redshift and M g cuts, as well as possible 
observational bias. Larger surveys are underway to better measure this new holy grail for cosmic 
distances. For example, the Baryon Oscillation Spectroscopic Survey (B OSS) is estimated to map 
1.5 million LRGs in a much larger volume than the DR7, up to z ~ 0.7 (jSchleeel et alibood ). 

We measure the observed peak position s p to an accuracy of ~ 3.5% based on a model constructed 
from our mock catalogs results. The main source of this uncertainty is due to sample variance, of 
the DR7-Dim subsample used. Fitting data to a redshift space, non-linear model, we also explain 
sensitivity of determining the peak position to the range of data points used, as well as shot-noise. 
These systematics are shown to be less than la of the sample variance, but should be considered 
when the latter is reduced. We use our measur ement of s p to determ ine the ratio r s /Dy ( §4.6p . Our 
result agrees very well with that obtained by lPercival et al.1 (|2009l ) who analyze the oscillations in 
the power spectrum and quote results at a very similar redshift. Note that we use shape information 
in the correlation function and do not marginalize over cosmological parameters, but rather tested 
consistency of one fiducial cosmology. However, our result does not have tighter constraints than 
that obtained by their study. Thi s may be due to our m ock catalog error bars being larger than the 
lognormal approximatio n used inlPercival et al.l (|2009l ) or a difference in the range of scales used, 
among other things (see lSanchez et al.ll2008l for a comparison of the relative performance of £ and 
P(/c) estimates). 



Regarding claims of the absence of anti-correlations at the largest scales (jLabini et al 
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DR7), we point out that the mock realizations show a large variety of crossover values r c from 
positive to negative correlations. In Figure [12] we show all 160 DR7-Dim mocks. Their crossover 
points are indicated by short green lines, and that of the mean (white line) by the orange line at 
~ 140 /i _1 Mpc. We find 4% of the mocks do not crossover before 200 /i Mpc, but this value should 
not be taken too seriously as it increases with wider binning, which causes less noisy results and 
less crossovers. A similar result was showed to us by E. Gaztahaga for DR6 mock catalogs. The 
r c values (or s c as we measure in redshift space) are summarized in the histogram in the inset, 
showing a wide variety of crossovers between [120, 160] /i _1 Mpc and even some around 80/i _1 Mpc. 
We comment that the crossovers are defined as the first time the £ crosses through the zero value, 
and we do not account for returns to positive values. Though having different bias in clustering 
in respect to matter, galaxies should have the same crossover point between correlation and anti- 
correlation. We conclude that sample uncertainties still dominate our ability to perform such a test 
for determining r c . 
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6. Summary and Conclusions 



Data sets later than DR3 yie ld a broader baryonic acoustic peak and stronger large-scale clus- 
tering signal than measured by lEisenstein et al.l (|2005l ). In this paper we have demonstrated that: 



1. 



2. 



3. 



5. 



Differences between DR3 and DR7 results are not due to known systematic uncertainties in 
data analys is. Applying the s ame methods in the DR7 analysis, we reproduce the same DR3 
data set as lEisenstein et al.l (|2005l ) and match the same resulting £. 



Large-scale clustering of DR7 results are in agreement with the ACDM+HOD model used 
here (flat Qmo = 0.25). The average £ at scales 130 < s < 400 ft, -1 Mpc is within 1.5<7 variation 
of mock LRG catalogs. 

The detected baryonic acou stic peak position in D R7 seems stable within most investigated 
subsamples and agrees with lEisenstein et al.l (|2005l ). When analyzing £ results between 60 < 
s < 150/i~ 1 Mpc we find the peak position to be at s p = 103. 6± 3.6 /i -1 Mpc. This result is 
sensitive up to la to the upper boundary of range chosen for analysis, random-shot noise and 
binning. 

Using this last result we measure the ratio between the sound-horizon at the end of the baryon 
drag epoch r s to D v (Equ ation|7D at rjDyjz = 0.278) = 0.1394± 0.0049 Mpc. Our result is in 
excellent agreement with iPercival et al.l (|2009i ). who analyzes the power spectrum of roughly 
the same sample, and utilizing a different approach of analysis. Inserting r s obtained from 
WMAP5 we calculate Dy(0.278) = 1099± 38 ft, -1 Mpc. Comparison with other determinations 
in the literature are also in very good agreement (see Figure [IT]) . 

We find a lower bound of 10% of mock realizations that do not show evidence of a sign of a 
baryonic acoustic feature. However, given a mock realization with a clear peak, we conclude 
that we can measure its peak position value s p within lcr Sp ~ 3.5 /i _1 Mpc. 

A bright volume-limited subsample of DR7, DR7-Bright, shows a significantly stronger large- 
scale-signal than predicted by our mock catalogs, which is not explained by potential system- 
atics. 



The differences between DR3 and DR7 in the £ may be explained in two ways: signal variance 
or observational systematics. Our analysis of the LasDamas mock catalogs show that the signal 
is still dominated by noise, which yields a variety of large-scale signals, wide and narrow baryonic 
acoustic peaks, as well as some featureless. 



In £ j4.4l we test a method of how the sen sitivity of LRG selection might affect the correlation 
function. Although lEisenstein et al.l (|200ll ) cautions that the number count of LRG targets is 
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sensitive to small variations in color cuts, our te st shows robustne ss of the large-scale clustering 



and of the baryonic acoustic feature in particular. iHogg et al.l (|2005l ) also demonstrates consistency 



of the survey calibration in its different patches, by examining DR3 SDSS LRG number counts in 
different regions of the sky. 

Our analysis of the apparent strong large-scale signal of the DR7-Bright subsample compared 
to the model used does not have the power to rule out clustering predicted by ACDM, as it is 
tested against one cosmology, and one HOD model. As explained, the HOD parameters, were fit 
using only small-scale clustering, and might not be the best choice on large scales. However, it is 
reassuring that even in this case, the discrepancies of the data compared to our mocks are not much 
larger than at the 2<r-level (in order to rule out the cosmology used, one would have to marginilize 
over all possible HOD fits, which we have not done). Furthermore, a large-scale enhancement of 
the two-point correlation function for a fixed cosmology may be obtained in models o f primordial 
non-Gaussianity with parameter /nl > 0, see e.g. Fig. 10 in Desjacques et al. (j2009l ). It will b 
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interesting to follow up these issues with future surveys that will tighten uncertainties. 



In summary, we show in this study that the SDSS LRG DR7 sample is consistent with ACDM, 
and the baryonic acoustic feature is stable within its various datasets and most subsamples. 
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A. Selection Functions 



Here we explain the angular and radial selection functions of the SDSS LRG sample, as well as the 
mock catalo gs. When sele c ting a nd weighting LRGs in DR3 and DR7, we followed the procedures 
described in IZehavi et al.l (|2005l ) for the most part, and explain here a few differences from that 



study, none of which affect the end results. 



A.l. Angular Mask 



The SDSS LRGs cover roughly 20% of the sky, mostly in the Northern Galactic Cap, and partially 
in the Southern (Figure H]). After an image of the sky is obtained in the u,g,r,i,z bands, objects 
are targeted for a spectroscopic observation to mea sure their redshifts. Here we refer to targeted 
quasars, Main galaxies and LRGs as targeted objects. IZehavi et al.l (|2005l ). in their appendix, discuss 
re asons that some targe ted objects fail to receive spectra, and their method (which is also used 



m 



Eisenstein et al.ll2005l ) of correcting for this small incompleteness. Please refer to their study 
for definitions of "sector", "sector completeness", "fiber-collisions" and the polygon method with 
which the complicated angular geometry is expressed in DR3 and later releases. 



We choose a slightly different definition for sector completeness than IZehavi et al.l ((2005). We 
define it as the number of targeted objects (Quasars+LRG+Main) that have obtained spectra (af- 
ter fiber-collision corrections) divided by the total number of targeted objects. By "fiber-collision 
corrections" in the number of targeted objects which obtained spectra, we mean that before calcu- 
lating sector completeness, we up-weight objects with spectra which are within 55" of objects that 
did not. For example, if we find a group of 4 objects in 55" proximity and three of them get spectra, 
all three are fiber-collision weighted by 4/3. This up-weighting plays two roles in our algorithm: 
defining sector completeness and while counting galaxy pairs. 

We mentioned above our definition for sector completeness. If, for example, the only objects which 
did not get spectra within a sector are due to fiber-collisions, the sector is considered fully complete. 
A rare peculiarity occurs due to some 55" neighbors at different sides of a border between sectors 
in which one (or more) obtain spectra and another does not. In these cases some sectors might 
have a completeness larger than unity, meaning they obtain a partial completeness fraction from 
the neighboring sector. In summary, the average completeness of all sectors is 98% for DR7. If we 
define Main+LRG as 'targeted objects' (excluding quasars) the completeness yields the same, and if 
we define targeted objects as LRGs only we obtain 96%. We find that these differe nt definitions for 



compl eteness result in subtle mismatches of the DR3 LRGs compared sample to lEisenstein et al 
(|2005l ). all of which less than 0.7% of the ~ 47, 000 galaxies, a nd ne gligible effects on £ compared 



to shot-noise of random points used. As in lEisenstein et al.l (|2005l ) we limit ourselves to sectors 
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with > 60% completeness. This results in 29 sectors (that have a non-zero completeness) with a 
total area of 13 square degrees (0.16% of targeted sky) and a total of 364 targeted LRGs. As in 



Eisenstein et al.l (|2005l ) we used the completeness as a probability to exclude random points from 
each sector. They also up-weight both data and random points in each sector by the reciprocal of 
the completeness value. With the high completeness of the survey we do not expect differences in 
the resulting £ to both methods. 

To account for fiber-collisions while pair-counting, we use a slightly different population than 
before. LRGs get up-weighted if they are 55" neighbors of a targeted LRG (not quasar or Main 
galaxy) which did not get spectra. This effectively increases our sample LRG number by ~ 2.2% for 
DR7-Full and ~ 1.8% in DR3, which is important to take into account when calculating normalized 
number of pairs DD, DR and RR. 

The LasDamas mock catalogs match the survey geometry as described by the polygon description 
in the NYU-VAGC. Since the "gamma" release mocks do not model fiber collisions nor missing 
sectors, completeness is defined to be lOOFor more details please refer to McBride et. al (2009; in 
prep). 



A. 2. Radial Selection Function 

The LRG sample used here is the largest quasi-volume-limited spectroscopic sample of its kind 
today. That said, the radial selection function, n(z) is not constant (meaning not volume limited), 
as one would expect in a homogeneous universe (up to Poisson noise and radial clustering). Instead 
it is quasi-volume-limited up to z ~ 0.36 and flux limited thereafter. In what follows we show that 
the features in the n{z) of DR7-Dim do not affect our results. On a more technical note we discuss 
distributing random points in the radial direction. 

We test the LasDamas mock catalogs for the features in the observed radial selection function. 
The original mocks do not trace the n(z) of the corresponding SDSS redshift and luminosity cuts. 
The top left inset in Figure [TBI shows this difference in shape for the DR7-Dim sample, as LasDamas 
provides an n(z) with a slight negative slope (cyan; average over 160 realizations). This slope in the 
LasDamas mocks is a consequence of applying fixed HOD parameters to a light-cone halo catalog, 
as it neglects the evolution of the dark matter halo mass (i.e. lower halo masses as higher redshift 
result in fewer artificial galaxies and hence the negative slope). The survey result (thick green 
histogram) is the same as in Figure [2j Two features noticeable are the negative slope between 
0.16 < z < 0.28, and the positive slope to the peak at z ~ 0.34. We show that these features 
have a negligible effect on £ and the r.m.s <7g compared to the distribution given by the original 
mocks, which is closer to volume limited. To validate this claim, we exclude mock LRGs such that 
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they match the SDSS radial selection function. For the DR7-Dim this meant excluding 15% of the 
mock galaxies (n(z) average over 160 realization shown as black histogram) and for DR7-Bright it 
meant excluding 7%, yielding, on average, a similar number count and volume density as in Table 
1. In the main plot of Figure [131 we compare results before and after exclusion of the mock galaxies, 
and find that the mean £ of 160 realizations agrees on all scales. The right inset shows that the 
diagonal terms of the covariance matrix crt = y/Ca is slightly higher for the galaxy-excluded sample 
as expected from slightly larger Poisson noise. We conclude that the shape of observed n(z) for the 
DR7-Dim sample does not appear to affect the results obtained in this study. 

We would also like to address the issue of distributing redshifts to random catalogs, a non- 
trivial step in using random-point based £ estimators, when dealing with non-volume-limited data. 
Two popular methods for distributing redshifts (or comoving distances) to random points are 
redistributing the actual data redshifts randomly, and assigning random distances, so that the 
overall n(z) shape matches that of the data (i.e, using the data n{z) as a probability function). We 
test both random point distribution with and without radial weighting. 

We show results of both methods of distributing random points in Figure[T4l To clarify differences, 
in the top panel we plot s ■ £ and the ratio between all cases to Data-Redshift in the bottom 
panel. For convenience we define the case of distributing data redshifts to random points as "Data- 
Redshift" (diamonds), and the case of distributing random points to match the data n{z) as "Data- 
n(z)" (crosses). The data used in this analysis are the original DR7-Dim mocks averaged over 8 
realizations. We compare results without radial weighting (black) as well as with (bright green, 
shifted by 2 /i _1 Mpc for visual clarity, shift not calculated within s • £). The radial weighting when 
pair counting is explained bellow. The figure shows that "Data-Redshift" is noticeably weaker 
than "Data-n(z)" until s < 110/t _1 Mpc. This is a clear display of the diminishing of the radial 
clustering modes. Although this effect is very small relative to our current la measurements, it 
will be important once these are reduced. On the larger scales the differences are diminished. The 
red crosses emphasize the importance of weighting the random points in the same fashion as the 
data. Not doing so yields spurious clustering. 

In conclusion, we choose to radially distribute random points to match the same n{z) shape of 
the data, using the same weight, and not distributing the data redshifts to the random points, as 
to preserve the radial modes. 

After deciding to use the survey n(z) to analyze the mock catalogs we have two options of use 
of random points: (a) compare all data points to one random set, which has the mean n(z) of the 
data; or (b) imitate the observation for each realization by comparing each to a random catalog 
with a tailor-made random catalog. By "tailor-made" we refer to the adjustment of the redshifts 
z, so that each random catalog yields a similar n(z) to that of the mock data. We applied this 
difference when analyzing DR7-Dim and DR7-Bright and find no difference in the £ mean results. 
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As for the r.m.s we find very small differences in both. On scales up to s < 200/i~ 1 Mpc the 
DR7-Dim shows fluctuations smaller than 4% where DR7-Bright shows fluctuations below 8%, with 
no particular preference for either method. This might be a result of the fact that the transverse 
modes dominate the radial ones. For completeness we note that in our mock results we use the 
former method, i.e, we apply one random catalog for all the mock data, where the ratio of random 
to data is r ~ f0.5. 

To weight the data approximately according to volume, we need to account for the radial selection 
function n(z) (see Ap pendix lA.2l and Figure[2]). To do so, we apply the standard weighting technique 
(|Feldman et al.lll994h . 



For the SDSS LRGs we calculate n(z) in bins of Az = 0.015 and use a spline interpolation to 
calculate the radial selection value of each LRG and random point. Each point is then assigned 



a radi al weight of 1/(1 + n(z) ■ P w ) where we use P w = 4 • 10 4 /i 3 Mpc , as in Eisenstein et al 



(|2005l ). The weights are applied to both the d ata and the random catal ogs. The choice for Az 
does not affect the measured £ (as also shown in lCabre &: Gaztanagall2009i ). We also examined for 



differe nces between choosing the observed n(z) for weighting and the model used in lZehavi et al 
(2005) and found no difference. 



B. Systematics of £(s) 

Here we address possible data analysis effects on our results. We focus on weighting schemes 
both in the radial and in the angular masks. In general, we find that the position and width of 
the baryonic acoustic feature does not change much, though we do find small effects in s p due to 
Poisson shot-noise. 



B.l. Random Shot Noise 

When comparing between different £ results obtained by various systematics, it is important to 
differentiate biases from variances obtained by random shot-noise. 

Applying five different random catalogs to the observed DR7-Dim , at a ratio of ~ 15.6 random 
points for every data point, and assigning radial weighting (similar to that performed in the analysis) 
we show in Figure [15] that the random shot-noise is minimal on most scales of concern. The top 
panel shows the £ for five different random catalogs against the same observed sample (black-dotted, 
green-dashed, red-dot-short-dashed, orange-long-dashed, blue-triple-dotted-dashed). To facilitate 
noticing differences the bottom panel shows £ ratios in respect to the first random catalog. For the 
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chosen binning, the baryonic acoustic region seems to yield less than a 10% difference. We reran our 
s p algorithm on all random catalogs and find the peak positions [102.3, 102.9, 103.0, 105.3] /i _1 Mpc. 
This shows that, although the overall shape seems very consistent when using the different random 
catalogs, the random-shot noise has a small effect on pinpointing peak position. This is currently 
smaller than the survey la, but should be considered when statistical uncertainties improve. The 
fitting normalized x 2 range for all five catalogs is x 2 /d.o.f = [1.08 — 1.38]. To reduce this effect on 
our measurement in §4.51 we used a ratio of number of random to data of ~ 50. The peak position 
for is marked by the top arrow at s p = 103.6 /i _1 Mpc. 

In the region between 50 < s < 90 /i _1 Mpc we see up to 10% differences, which is expected in a 
sharp logarithmic slope. On smaller scales s < 50/i _1 Mpc differences yield up to 3% difference in 
amplitude. 

B.2. Effects of Weighting 

In order to optimally measure the correlation function and to account for the fiber collision effects, 
we apply weighting algorithms. All differences due to choices about how we weight turn out to be 
much smaller than current la variances. Angular weighting schemes (fiber-collision correction and 
sector completeness) are explained in Appendix IA.1I Here we explain our algorithm for the radial 
weighting and show results for both. 

Figure [16] shows the effects of various angular and radial weighting schemes on the large-scale £ 
of DR7-Full (for clarity we present s ■ £ in the top panel). The weighting schemes compared are: 

1. No weighting at all (black) 

2. Radial weighting only (cyan; shifted by 1.75h~ Mpc for clarity) 

3. Radial weighting + Fiber-collision (red; shifted by 3.5/i _1 Mpc ) 

As mentioned in Appendix IA.H sector completeness is taken into account in the distribution of 
the random points. The bottom panel shows the ratio of each of the above options £j over the 
non- weighted £o- To avoid possible random shot noise (see Appendix §B.lj) we use the same set of 
random points in each case. The weight is modified for each option for the data and random as 
indicated in the plot. The fiber-collision weight for the random set is always w = 1. 

We clearly see from the three above options the effect of radial weighting on scales s < 170 /i _1 Mpc. 
We can grossly divide this range into a smaller one of s < 115 ft Mpc in which the radial weight 
adds some power to the signal, and a larger scale region in which the signal is reduced. The fiber- 
collision correction weights ("Fiber-Coll") are important when dealing with scales s < 40ft~ 1 Mpc 
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to the few percent level, but not at ranges discussed in this work. Beyond 170 h 1 Mpc the effects of 



the w eighting schemes used here are minimal. Most importantly, as shown in lCabre fc Gaztahaga 



(|2009l ). the apparent strong £ signal at large-scales, and the baryonic acoustic feature position are 



consistent among the various weighting methods. 

These weighting effects are not substantial in Figure EJ because the n(z) used there (full DR7- 
Dim LasDamas catalog without dilution to match SDSS n(z)) does not have the same complex 
features seen in DR7-Full (the mocks used in that figure are instead close to volume-limited). 



B.3. Various Systematics 



We have checked a few other possible systematics collectively described here. 

The SDSS DR7 sample is mostly contiguous, with only 9.8% of the surveyed area not contained 
in the main part of the Northern Galactic Cap sample (Figure [1]). We examined three choices 
for which footprint to use: the full survey (100% of LRG sample), the Northern Galactic Cap 
only (90.9%), and the Northern Galactic Cap without the small "island" (90.2%). We found no 
significant difference in the resulting DR7 £ for these samples. 

In order to calculate the separation in comoving space between galaxy pairs, we must use the 
observed redshifts and assume a certain fiducial cosmology. For most of this study we assumed 
a flat ACDM model with Oa/o = 0.25. When analyzing d if ferenc es between DR3 and DR7-Full 
we also tested the cosmology assumed by lEisenstein et al.l (|2005l ) Qmo = 0.3. We also examine 
each cosmology on each subsample and found that the cosmology does not affect the resulting £ or 
the measured s p significantly relative to our other uncertainties. For a direct comparison of both 
cosmologies in DR7 see Figure [3l 

We also tested how the choice of cosmology affects the selection of LRGs, due to the fact that 
M g is sensitive to cosmology. Within DR7-Dim (0.16 < z < 0.36, -23.2 < M g < -21.2) we 
count 61,899 LRGs when using Qmo = 0.25 and 61,102 when £Imo = 0.30, a 1.3% difference. 
When probing the full sample, DR7-Full, we find an agreement in number of LRG selected in the 
two cosmologies to better than 1%. This implies that for large range of redshift an approximate 
cosmology should not change the number count very much. 



As mentioned in jSl the £ calculation requires the choice of a particul a r estimator. We tested 



the 



Landv <fe Sza lav ( 1993]) estim ator against those proposed by iHamiltonl (119931 ) . iDavis fc Peebles 



(jl983h . IPeebles & Hauserl ( 11974 ). Figures [T71 and [T8l show our £ and o"£ results, when using DR7 



Dim. In Figure [T71 the mock mean (over 160 realizations) are the lines, while the SPSS result are 
the symbols. The different colors indicate the different estimators used, where lLandv Szalav 
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(|l993l ) (black solid, LS93 hereon) and iHamiltonl ()1993l ) (green triple-dot-dashed, HAM93) are not 
distinguishable by eye for the most part. The inset shows the noise-to-signal ratio. Bear in mind 
that the spikes around 140 /i _1 Mpc simply result from the signal crossing zero around that scale. 
In Figure [18] the notation is the same when plotting cr^(s) for the different estimators. The inset 
in this plot shows the ratio of each estimator relative to that of LS93: 

We find the following: 



1. Mean value £: Averaging over all 160 mocks we see that LS93 and HAM93 agree on all 
scales. The inset of Figure [18] shows that Peebles & Ha userl (1 1974!) (PH74) deviates from the 
latter by 10% at scales of ~ 75 /»~ 1 Mpc and lDavis & Peeblesl |l98J) (DP83) does the same at 
85 h~ 1 Mpc. We also note that LS93 and HAM93 asymptote faster to zero than the others. 

2. r.m.s The r.m.s of the signal varies among the estimators on various scales. PH74 yields 
the same as LS93 up to scales of ~ 40 /i _1 Mpc before it starts strongly branching off. This 
is clearly seen in the main plot of Figure [18] and indicated in the other plots by the strong 
variation of the observed result. Both HAM93 and DP83 have a larger variance than LS93 
at scales smaller than 10/i _1 Mpc. HAM93 later matches the LS93 on all larger scales very 
well, where DP83 breaks off at ~ 60/i _1 Mpc. 



We conclude that the lLandv Szalavl (|1993l ) and IHamiltonl (|1993l ) estimators agree in signal for 
each realization on large scales, and perform much better than the other two as their variance is 
smaller, and converge much more quickly to zero. In particular, the other two esti mators yie l d very 
large uncertainties on large scales for individual r ealizations. We also find that IHamiltonl (|1993l ) 
does not perform as well as lLandy Sz Szalavl ( 19931 ) on smaller scales s < 10/i _1 Mpc. Our analysis 
on DR7-Dim agrees with the other samples DR7-Bright, DR7-Bright-Near, and DR7-Bright-Far. 
Here we use ratio of rando m points to data of r ~ 15.6 for the SDSS and r ~ 10 for mocks. 
Kerscher et al.l ([2000) shows lLandy &; Szalavl (|l993l ) to be the estimator with the best preformance 
relative to the true £ in periodic box measurements. 



To test the importance of binning, we use the lLandy fc Szalavl (j 19931 ) on all 160 mock DR7- 
Dim realizations and find a strong dependence of the variance on the bin size, as expected. We 
test various bin widths As between As = [0.55, 10] h~ 1 M.pc noting that variance changes strongly, 
where the lowest As yields the noisiest ag, due to the higher shot-noise in each bin. For most of 
our analysis, including the baryonic acoustic feature, we used As = 4.44 h~ Mpc. We find that 
peak position s p is not altered by more than la for any choice of bin size. 

We also check for boundary limits in our survey, to get an idea of at what point our £ estimates 
are encountering the edges of the survey. We show in Figure [18] a negative slope for all the 
way to s < 400 h^Mpc. A region we definitely want to avoid in analysis is one in which has a 
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positive slope, which indicates an upper limit to the effective scaling given the survey volume. For 
this reason in the top panel of Figure [19] we continue this plot for DR7-Dim (black solid) and DR7- 
Bright-Near (green triple-dotted-dashed) to 800/i _1 Mpc . These subsamples (both limited to the 
range 0.16 < z < 0.36) have a declining to 500/i _1 Mpc, and a slightly positive slope thereafter. 
In the second plot from the top we show the normalized data-data pairs DD of DR7-Dim and 
DR7-Bright-Near (same notation), as well as the random-random pairs RR count of DR7-Dim 
(blue dot-dashed). The dip feature at ~ 500 /i~ 1 Mpc mentioned before appears here as the peak at 
that scale. In the third plot from top we differentiate the previous results by s to better see where 
the number of pairs stop growing with radius. We see a clear crossover between 500 - 600 h^Mpc. 

The bottom two plots of Figure [19] can be considered "sanity checks", as we verify basic statistics. 
Given a periodic box we expect the number of random points N around a given a point at radius 
s in a spherical shell of width ds to be: N = Aits 2 dsn where n is the mean density. RR(s), is 
expected to go as ~ N(s) ■ Nr/2, where Nr is the number of random points, and 2 takes into 
account double counting. Hence dln(RR) / d\n(s) should yield 2. Deviations from this result are 
due to boundary effects. In the fourth from top plot we present dln(x) / dln(s) , where x is RR 
(blue dot-dashed) and DD (solid black). The important result from this plot is the deviation of 
dln(RR)/dln(s) from 2 at large-scales. A 5% difference is noticed at 50/i~ 1 Mpc. At the baryonic 
acoustic feature scale the deviation from 2 is ~ 10%, at 200 /i _1 Mpc by ~ 23% and at 400 /i -1 Mpc 
by 60%. Although this panel indicates our volume is far from ideal for analysis of 400 /i _1 Mpc, the 
LS93 estimator appears to be valid as it corrects for these boundary effects by comparing number 
of data pairs to number of expected random points given boundary conditions. We preformed the 
same test for random points within a volume of DR7-Bright (red dot-dashed) where we extend 
measurement to 500 ft, _1 Mpc. 

In the bottom plot we determine how many cubes of length s would fit into the volume con- 
tained within DR7-Dim (black solid) and DR7-Bright (red three-dot-dashed). Our choice of 
s = 400/i~ 1 Mpc yields 10.31 for the former and 18.65 for the latter. For DR7-Bright up to 
s = 500 /i _1 Mpc we count 9.55 cubes. As for the peak position at s = 103 /i -1 Mpc , we count 604 
cubes for the DR7-Dim volume and 1093 for a DR7-Bright volume. 

All the plots in Figure [19] indicate that our choice of s < 400/i~ 1 Mpc is a fair one within the 
given sample, and does not depend too much on edge effects. 

We also test for uncertainties of the observed DR7-Dim and DR7-Bright against those pre- 



dicted by Gaussianity plus shot noise (jBernsteinl 11994 ) at large scales, 



J d*k [P{k)+n- l ] 2 j {ks)\ (Bl) 



where V is the volume of the survey (see Table 1), P{k) the galaxy power spectrum, and jo the 
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spherical Bessel function of zeroth order. We first test this formula against the variance derived from 
the mock catalogs for both Dim and Bright samples, and find that at 50 < s < 100 ft Mpc the vari- 
ance is consistent with Eq. IB1I At smaller scales non-Gaussian contributions to the errors quickly 
make Eq. (|Bip an understimate of the uncertainties, whereas at scales s > 100/t Mpc contribu- 
tions to Eq. (|Bip from edge effects (not included there) become important, e.g. making the total 
error about 20% larger than given by Eq. dHO at 300 /r x Mpc. Since edge effects depend only on 
the geometry and density of the sample, we can extract their value from this comparison and apply 
it to the data, which has the same characteristics (but different two-point function). 

From the measured two-point function in the data we use Eq. (|B1|) plus the edge effects variance 
to compute Gaussian errors for a model that matches the observed £. We find uncertainties that are 
very close to those in the mock catalogs at s > 150 /i _1 Mpc, the main reason being that discreteness 
contributions are significant. This suggests that had we changed cosmology and HOD to give a 
better fit to £ at largest scales, we would have gotten very similar uncertainties, and thus similar 
~ 2a deviations away from zero-crossing for the Bright sample. 
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Fig. 1. — Angular Selection Function : SDSS DR7 LRG sky coverage. For plotting purposes we 
present one tenth of the 105, 831 DR7-Full galaxies. The solid blue line is the Galactic plane. 
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Fig. 2. — Radial selection function: Comoving number density n(z) of the full DR7 (DR7-Full; 
black) and its subsamples DR7-Dim (green), DR7-Bright (blue), DR7-Bright2 (red) and DR3 
(cyan). 
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Fig. 3. — DR3 and DR7 Our DR3 results (green diamond s ) show excellent agreement on the 
scales investigated here with those published by lEisenstein et al.1 (|2005l ) (red crosses and uncertainty 
bars). The remaining discrepancies are consistent with shot-noise in the random catalogs. The 
dashed lines are our results for DR7-Full which shows a stronger clustering signal at 135 < s < 
180/i _1 Mpc . In the thick blue dashed line we used the same £Imo = 0.3 flat cosmology as the 
DR3 results, and thin black dashed line FImo = 0.25. Both cosmologies agree very well at scales 
discussed here 80 < s < 200/i~ 1 Mpc. In our DR3 result we use number ratio of random to data 
points r ~ 50 and for DR7 r ~ 15.6. 
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Fig. 4. — DR7 Subsample £(s): Comparing DR7-Full (thick dashed line) to subsamples DR7-Dim 
(green diamonds), DR7-Bright (blue crosses) and DR7-Bright2 (red squares). DR7-Bright shows 
a stronger signal than the other samples on most scales. The peak position appears consistent for 
all subsamples. For the uncertainties of DR7-Dim and DR7-Bright please refer to Figures [5] and 
respectively. 
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Fig. 5. — DR7-Dim £(s): Results from SDSS (black diamonds) and from the LasDamas mock catalogs. 
The mock mean £ moc j. is the green solid line and the uncertainties in the mean are small vertical green lines. 
The variance for one realization is presented by the gray bands: 68.2% light gray (la moc k), and 95.4% by 
dark (2a moc k)- The blue dotted lines arc the outermost result of all mocks in each separation bin (not one 
realization in particular). Top Inset: Same format as main figure extending the results with wider bins to 
larger scales. Bottom Inset: Significance of large-scale clustering - we average £ in bins [130, 400] /i -1 Mpc 
. The observed (£) (red thick dashed line) is clearly within 2a of the mock realizations (black histogram) , 
with a x 2 fitting on 10 d.o.f yielding \ 2 /d.o.f = 0.72. The mock mean result is the thin green dot-dashed 
line. 39 




Fig. 6. — DR7-Dim £(s) Mock Realizations: Hand-picked LasDamas mocks (lines) compared with 
the SDSS result (diamonds). The green (dotted) and red (solid) show fairly good agreement on 
most large-scales with observation, though their peak position appear to be in different locations. 
The orange (dashed) and cyan (dot-dashed) mocks show example realizations without a baryonic 
acoustic feature. Using a liberal approach, we counted a minimum bound of 10% (17) from the 
full set of 160 mocks with no sign of a peak. 
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Fig. 7. — Relative Bias: Comparing redshift-space bias ratio bom-Bright /bDR7-Dim = 
\J £,DR7 -Bright / £,DR7-Dim between the two subsamples for both the observed LRGs (symbols) as 
well as the LasDamas mock mean (dotted lines). The average observed value is taken between 
— 120/i _1 Mpc indicated in dashed lines valued at 1.15. 
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Fig. 8. — DR7-Bright £(s): Same format as Figure © for the brighter sample. The insets here 
extend to 500 h M.pc. The observed subsample shows a significantly stronger signal at large scales 
than produced by our ACDM+HOD model. 
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Fig. 9. — Baryonic Acoustic Peak Position in DR7-Dim mocks: Top Panel- The line for the 75 
clear-peaked mocks £75 (bright green dashed) shows a more pridominant peak than that of the 
full catalog £ieo (solid black). Both means show roughly the same peak position, and width. The 
vertical lines are la uncertainties of the mean. Bottom Panel- Residual £ 75 — £ 160 as function of 
scale. 
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Fig. 10. — Measuring the Baryonic Acoustic Peak Position s p in DR7-Dim Sample: Top Left 
Panel- Histogram of 75 mock realizations with clear sign of a peak. Vertical red (thick dark) solid 
line is observed value and green (thin bright) is that of mock mean. a± =239 /i _1 Mpc are the la 
CL when counting 68.2% of the mock peaks around the mock mean 107.88 /i _1 Mpc, normalized to 
that of the observation by a ~ £. Top Right Panel- x 2 histogram for mock realization fits to model, 
where the red line indicates that of the observation at x 2 = 23.6 for 19 d.o.f. Bottom Right Panel- 
Fitting the observed £(s) (black diamonds) to model /?£(« • s) (blue line), where £ is the mock 
mean. Red arrow indicates s p = 103.6 /i _1 Mpc ami is the same as in the top left panel. Indicated 
also are best-fit a and f3 parameters. Black vertical lines are y/Cu. Bottom Left Panel- Cij/((Ti<jj). 
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Fig. 11.— r s (z d )/D v (z) Result: We obtain r s /D v (z = 0.278) of 0.1394± 0. 0049 (black diamond ; 
la uncertainty) in good agr eement with the z = 0.275 result pres ented by iPercival et al.1 ((2009) 



(red crosses) . Othe r results ( Sanchez et al. 2009: purple triangles, Reid et al. 20091 : 



cyan crosses, 

Percival et alj|2007l : orange crosses: lEisenstein et alj|2005l : blue square) are indicated. These points 



are not all independent as they use the same sample. The solid lines show predictions of various 

flat ACDM cosmologies constraining Qmo^ 2 = 0.1358 and varying Qmo an d h, where the top (blue) 

line is Omo = 0.19, h = 0.84 and the bottom (orange) line is Qmo = 0.33, h = 0.64. Intermediate 
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steps shown are CImo = 0.22,0.25,0.29. Our result clearly agrees with ranges Qmo = [0.25,0.33] 
and h = [0.64,0.73]. 
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Fig. 12. — £(s c ) = 0: Here we show all 160 DR7-Dim mock realizations. Those with a crossover 
point s c (crossover marked in vertical green lines) are blue solid lines, and the 4% without are in 
thick red. The mean value is the solid white line, and its s c ~ 140 /i _1 Mpc is indicated by the 
vertical orange line. In the inset we show a histogram of all s c values, where the dotted orange line 
is the mean value. 
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Fig. 13. — n{z) effect on £ and in DR7-Dim: Left inset shows n{z) of LasDamas mock catalogs 

(light cyan histogram; n(z)i) which has a slight negative slope, and that observed in the SDSS 

sample (thick green histogram, same as in Figure [2j n(z)sDSs)- We exclude 15% of the mock 

galaxies in each realizatoin to match ra(z)sDSS- The result is the black histogram (71(2)2). Ln the 

main plot the mean £ over 160 mock DR7-Dim realizations for the different selection functions. 

Cyan diamonds are results of n(z)i and the black crosses are for n{z)2- Both £(s) are radially 

weighted with their respective n(z), and are in very good agreement. Right inset shows the same 
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for the r.m.s a^, indicating a slightly stronger variance for the latter sample, as it has slightly higher 
Poisson shot-noise. 
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Fig. 14. — Random point distribution effects on £ in DR7-Dim: Results for average over 8 mock 
realizations when using original mock n(z)i (Figure fl~3j) . In each realization we distribute radial 
distances to random points differently: diamonds- data point redshifts distributed randomly to 
random catalog (Data-Redshift), crosses- random n(z) matches data but otherwise independent 
redshift distribution (Data-n(z)). Radial weighting schemes: Black- no weighting , Blue- both data 
and random are weighted radially in same manner (shifted to right by 2ft. _1 Mpc for clarity), Top 
Panel- To clarify differences we plot s ■ £(s). Bottom Panel- Ratio of each case £j to unweighted 
Data-Redshift (£q)- We clearly see that Data-R^eclshift (diamonds) yield the lowest result, hence 
diminishing the radial clustering mode. This is noticeable in range to 50 < s < 130/i _1 Mpc after 
which results agree with most of the other options. Weighting causes no effect in this case as the 
n(z) used is close to volume-limited. 
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Fig. 15. — Random Shot-Noise: We run the same algorithm on the observed DR7-Dim, using five 
different random catalogs, each in turn. The £ results are given in the top panel, where the five 
random sets are represented by: black-dotted, green-dashed, red-dot-short-dashed, orange-long- 
dashed, blue-triple-dotted-dashed lines. To facilitate differences the bottom inset shows the ratio 
of each i th catalog in respect to the first random catalog xi\. The inset has the same format as 
the bottom panel for smaller sacles. The bottom arrows in both panels pin-point the peak position 
found, according to our algorithm. We see that four random catalogs result in same s D within 
1 /^Mpc where a fifth is 3 ^Mpc larger than^e smallest obtained value. These results are for 
a ratio of random to data points of r =Nr/Nd ~ 15.6. To reduce this effect in our main analysis 
we use r ~ 50 (its result is marked by top arrow at s p = 103.6 /i _1 Mpc). 
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Fig. 16. — Weighting Effects on £: We compare different weighting schemes on the observed DR7- 
Full. wd and wr indicate the weight used on data and random points respectively, where black 
(diamonds) indicates no weight used (w = 1), cyan (triangles) indicates radial weight (Radial; 
shifted by 1.75/i _1 Mpc for clarity) and red (crosses) indicates radial+ (fiber-collision correction) 
(Fiber-Collision+Radial; shifted by 35/i~ 1 Mpc ). All options take into account sector completeness 
corrections. For clarity we plot in the top panel s-£(s) and on the bottom the ratio of each weighting 
option & over the first (wd = wr = 1; Co)- 
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Fig. 17. — 6 Estimato rs at Large Scal es of DR7-Dim : Comparing e stimators proposed by LS93 
(|Landv fc Szalavlll99.4 black), HAM93 (|Hamiltonl[l993l: green), D P83 (bavis fc Peebleslll983l : blue, 
symbols shifted for clarity) and PH74 ([Peebles &: Hauserlll974l ; red, symbols shifted). The lines 
are mock mean values, and the symbols are those observed. The uncertainty bars on the symbols 
are the r.m.s of the mocks. Although the observed results appear in reverse order than the mock 
mean, we verified by looking at particular mock realizations that some are in this order as well. 
Inset: shows the Noise to Signal u^/i- The strong spikes are due to sensitivity at the signal zero 
crossing. Refer to Figure [18] for cr^(s) of the different estimators. We find that LS93 and HAM93 
agree very well on large scales, and asymptote quicker to zero than the other two. The ratio of 
random to data points used for SDSS is r ~ 15.6 and for mocks r ~ 10. 




Fig. 18. — Estimator Variance: Main Plot: Same as Fig. [T71 showing the r.m.s <r^ = \JCa where Cij 
is the covariance of £. PH74 and DP83 show poor variance performance on large-scales. HAM93 
agrees very well with LS 93 on large scale s but has a larger variation on < 10 /i _1 Mpc. Inset: 
Comparing estimators to lLandv Szalavl (|1993l ). The ratio of random to data points used for 
SDSS is r ~ 15.6 and for mocks r ~ 10. 
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Fig. 19. — LRG and random point statistics up to s = 800 /i _1 Mpc. a) signal r.m.s for DR7-Dim 
(black solid line) and DR7-Bright-Near (green triple-dot-dashed) . b) Normalized data-data pairs 
DD for DR7-Dim and DR7-Bright-Near (same notation) and random-random of DR7-Dim (blue 
dot-dashed) c) Same as (b) only derivative dx/ds. where x =DD,RR. d) Examining boundary 
effects in pair counting. A periodic box would yield dln(RR) / dln(s) = 2. Hence deviations from 2 
(dotted line) are effects of boundary, e) Counting the number of cubes of length s fit in the survey 
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Calculations of DR3 performed using Qmo = 0.3. 



